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Abstract 

Raghavendra (STOC 2008) gave an elegant and surprising result: if Khot's Unique Games 
Conjecture (STOC 2002) is true, then for every constraint satisfaction problem (CSP), the best 
approximation ratio is attained by a certain simple semidefinite programming and a rounding 
scheme for it. 

In this paper, we show that similar results hold for constant-time approximation algorithms 
in the bounded-degree model. Specifically, we present the followings: (i) For every CSP, we 
construct an oracle that serves an access, in constant time, to a nearly optimal solution to a 
basic LP relaxation of the CSP. (ii) Using the oracle, we give a constant-time rounding scheme 
that achieves an approximation ratio coincident with the integrality gap of the basic LP. (iii) 
Finally, we give a generic conversion from integrality gaps of basic LPs to hardness results. All 
of those results are unconditional. Therefore, for every bounded-degree CSP, we give the best 
constant-time approximation algorithm among all. 

A CSP instance is called e-far from satisfiability if we must remove at least an e-fraction of 
constraints to make it satisfiable. A CSP is called testable if there is a constant-time algorithm 
that distinguishes satisfiable instances from e-far instances with probability at least 2/3. Using 
the results above, we also derive, under a technical assumption, an equivalent condition under 
which a CSP is testable in the bounded-degree model. 

Key words: Constant-time approximation, constraint satisfaction problems, linear program- 
mings, rounding schemes, property testing. 
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1 Introduction 



In a constraint satisfaction problem (CSP), the objective is to find an assignment to a set of variables 
that satisfies the maximum number of a given set of constraints on them. Formally, a CSP A is 
specified by a set of predicates over alphabets [q] = {!,... ,q}. Every instance of A consists of a 
set of variables V, and a set of constraints V on them. Each constraint consists of a predicate from 
A applied to a subset of variables. The objective is to find an assignment /3 G [q]^ to the variables 
that satisfies the maximum number of constraints. A large number of fundamental optimization 
problems, such as Max Cut and Max /c-Sat, are examples of CSPs. 

Approximation algorithms for CSPs have been intensively studied. Goemans and Williamson [9] 
first exploited semidefinite programmings (SDP) to Max Cut and Max 2SAT achieving the approx- 
imation ratio ~ 0.878. After this breakthrough, plethora of approximation algorithms using SDPs 
have been developed [151 |2T]. For inapproximability side, tight hardness results have been success- 
fully obtained for some important optimization problems such as Max 3SAT [H]. However, the 
approximability of many interesting CSPs such as Max Cut and Max 2SAT remains open. Towards 
tightening this gap, Khot [16] introduced the Unique Games Conjecture (UGC). Assuming the 
UGC, tight hardness have been shown for Max Cut [IT], Max 2SAT [3], and Max k-CSP [5t l27]. 
Finally, Raghavendra [25] succeeded to unify and generalize those approximation and inapproxima- 
bility results for every CSP. Specifically, Raghavendra showed that, assuming the UGC, for every 
CSP, a certain SDP combined with a certain rounding scheme attains the best approximation ra- 
tio among all polynomial-time approximation algorithms. The ingenious technique in the proof is 
giving a generic conversion from integrality gaps of SDPs to hardness results via the UGC. 

In this paper, we are concerned with constant-time approximation algorithms CSPs. That is, 
algorithms are supposed to run in time independent of sizes of instances. We use the bounded- degree 
model, which was originally introduced for graphs [11]. In this model, the number of alphabets, 
the maximum arity (the number of inputs to a predicate), the maximum degree (the number of 
constraints where a variable appears), and the maximum weight of constraints are bounded by 
constants. Let X be a A-CSP instance. Since a constant-time algorithm cannot read the whole I, 
we assume the existence of an oracle Ox with which we can get information of I. By specifying a 
variable v and an index i, Ox returns a constraint P where P is the i-th constraint where v appears. 
The efficiency of an algorithm is measured by the number of accesses to Oj, which is called query 
complexity. 

In this paper, we show an analogous result to Raghavendra's result: for every CSP, a certain 
linear programming (LP) combined with a certain rounding scheme attains the best approximation 
ratio among all constant-time approximation algorithms. Furthermore, our results are uncondi- 
tional. To give the statements precisely, we need to define several notions. For a A-CSP instance X 
with the variable set V and the constraint set V, there is a natural generic LP relaxation which we 
call BasicLP (see Section[2]). Let lp(2^) denote the objective value of an optimal solution to BasicLP 
for X, opt(X) denote the value of an optimal solution of X, and val(X, /3) denote the value obtained 
by an assignment /3 S [q^ ■ We define wx as the sum of weights of constraints in X. Then, we 
define lp(X) = lp(X)/i(;x, opt(X) = OY>t[X)/wx and val(X, /3) = val(X, /3)/iux. The integrality gap 
curve S\{c) and the integrality gap a\ of a CSP A is defined as 

Sk{c) = i_nf opt(X), a A = inf opt(X)/lp(X). 

2eA,ip(X)>c 2:eA 

The first result of this paper gives a tight approximation algorithm for every CSP. 

Theorem 1.1. In the bounded- degree model, for every CSP A and e > 0, there exists an algorithm 
that, given a A-CSP instance X with n variables and lp(X) = c G (0,1], with probability at least 
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2/3, outputs a value x such that S\{c — e)wx — en < x < opt(X). Also, for some fixed assignment 
/3 such that S\{c — e)wx — en < val(X, /3) < opt(X), given a variable v in X, it computes /3„ in 
constant time. 

The algorithm computes by rounding an LP solution to BasicLP for I. Note that, for an 
instance X with lp(X) = c, S\{c)wx is the best value we can hope for from the definition of S\{c). 
Thus, in this sense, we will give a optimal rounding scheme for BasicLP. 

We mention that the additive error en cannot be removed. To see this, suppose an instance 
X consisting of n variables and only one constraint. Then, we have to see this constraint to 
approximate opt(X) if we do not allow the additive error. However, it obviously takes Q{n) queries. 

For hardness side, we show the following. 

Theorem 1.2. In the bounded- degree model, for every CSP A,c G [0, 1] and e > 0, there exists a 
5 > such that any algorithm that, given an instance X with n variables and opt(X) = c G [0, 1], 
with probability at least 2/3, outputs a value x such that {S\{c) + e)wx — 5n < x < opt(X) requires 
fl{y/n) queries. 

Note that, using the algorithm in Theorem 11.11 given an instance X, we can distinguish the 
case opt(X) > cwx from the case opt(X) < S\{c — e)wx — en (Technically, we need that S\{c) 
is non-decreasing, but this is obvious from the definition). On the contrary. Theorem 11.21 asserts 
that we cannot distinguish the case opt(X) > cwx from the case opt(X) < (S'a(c) + e)wx — Sin. 
Thus, the algorithm given in Theorem 1 1.1 1 is not just the best among constant-time approximation 
algorithm using BasicLP, but the best among all constant-time approximation algorithms. 

A value x is called an [a, (3) -approximation to a value x* if it satisfies ax* — j3 < x < x* . An 
algorithm is called an (a, (3) -approximation algorithm for a CSP A if, given a A-CSP instance X, it 
computes an (a, /3)-approximation to opt(X) with probability at least 2/3 [23\ I24j. The following 
is an immediate corollary achieved by Theorems 11.11 and 11.21 

Corollary 1.3. In the bounded- degree model, for every CSP A and e > 0, there exists a constant- 
time {a\ — e, en) -approximation algorithm for the CSP A. On the other hand, for every CSP A 
and e > 0, there exists a 6 > such that any {a\ + e, 6n)- approximation algorithm for the CSP A 
requires i^{^/n) queries. 

Theorem 1 1 . 2 1 has much implication to property testing. A A-CSP instance X is called satisfiable 
if there is an assignment to variables that satisfies all the constraints. Also, X is called e-far from 
satisfiability if we must remove at least etwn constraints to make it satisfiable, where t, w, n is the 
maximum degree, the maximum weight, and the number of variables, respectively. An algorithm is 
called a testing algorithm for (the satisfiability of) a CSP A if, given a A-CSP instance, it accepts 
with probability at least 2/3 if the instance is satisfiable, and rejects with probability at least 2/3 if 
the instance is e-far from satisfiability. Unlike the hardness result given in [25], Theorem 11.21 holds 
also for c = 1, i.e., satisfiable instances. Using this observation, we have the following theorem. 

Theorem 1.4. In the bounded- degree model, the following holds for a CSP A. If S\{1) < 1, then 
any testing algorithm for the CSP A requires ^l{y/n) queries. //S'a(I) = 1 and S\{c) is continuous 
at c = 1, then there exists a constant-time testing algorithm for the CSP A. 

We mention that Theorem 11.41 gives an "if and only if" condition of the testability of CSPs 
when their integrality gap curves are continuous at the point one while we are not aware of any 
CSP for which the curve is not continuous at that point. 

We give two direct applications of Theorem ll.4l An instance of 2-SAT is a CNF formula where 
each constraint consists of at most two literals. It is known that ^Max 2-SAt(1) = 1/2, and it follows 
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that we need ^}{^/n) queries to test 2-SAT. On the contrary, 2-SAT is known to be testable with 
0{y/n) queries [10]. This fact imphes that the lower bound in Theorem 11.21 is almost tight. An 
instance of Horn Sat is a CNF formula where each constraint has at most one positive literal, 
From [31], it is easy to derive that S'Max Horn SAt(1) = 1 and S'Max Horn sat(c) is continuous at c = 1. 
Thus, Horn SAT is testable in constant time. 

Related Work: Subsequent to Raghavendra's work [25], under the UGC, certain SDPs and LPs 
are shown to be the best approximation algorithms for several classes of problems, such as graph 
labeling problems (including A;-Way Cut, 0-Extension, and Metric Labeling) [22], kernel clustering 
problems [18], ordering CSPs (including Maximum Acyclic Subgraph) [13j, and strict monotone CSPs 
(including Minimum Vertex Cover) [20]. 

There have been many studies on constant-time approximation algorithms in the bounded- 
degree model. For algorithmic side, mainly graph problems have been studied, e.g.. Minimum Span- 
ning Tree Minimum Vertex Cover [23l [Ml [3Qj , Maximum Matching [23l[30], Maximum Independent 
Set [1], and Minimum Dominating Set \22>\ I30j . For inapproximability results of graph problems. 
Minimum Dominating Set [1] and Maximum Independent Set [Il[29] have been considered. For CSPs, 
it is known that, for every e > 0, there exists (5 > such that any (1/2 -|- e, (5n)-approximation 
algorithm for Max E2LIN2 and (7/8 -|- e, (5n)-approximation algorithm for Max E3SAT require linear 
number of queries [6]. 

We can compute the optimal value of a CSP instance within an additive error 0{en^) by sampling 
poly(l/e) variables and by solving the induced problem, where n is the number of variables and s 
is the maximum arity [21 [3]. Thus, it is easy to approximate the solution of a dense instance in 
constant time. Hence, we are concerned with the bounded-degree model in this paper. 

Proof Overview: We describe a proof sketch of Theorem 11.11 Let 0% be the oracle access to a 
A-CSP instance X. First, we construct an oracle access Oip to a nearly optimal solution to BasicLP 
for X, Namely, if we specify a variable in BasicLP, Oip outputs its value by accessing Ox constant 
number of times. To this end, we use a distributed algorithm for packing/covering LP given in [19] . 
In the distributed setting, a linear programing is bound to a graph G = {V,E). Each primal 
variable Xi and each dual variable yj is associated with a vertex ^ V and Vj E V, respectively. 
There are edges between primal and dual vertices wherever the respective variables occur in the 
corresponding inequality. Thus, (ff,fj) G -E if and only if Xi occurs in the j-th inequality of the 
primal. Let G^^k denote the graph induced by vertices whose distance from v is at most k. Then, 
a distributed algorithm in k rounds works in such a way that each vertex outputs a value of the 
corresponding variable based on G^^k- In |19j . it is shown that if the matrix in the LP is "sparse," 
then there is a distributed algorithm that computes a nearly optimal solution to the LP in k rounds, 
where k is an integer determined by the sparsity of the LP. Suppose that the degree of the graph 
is bounded by A. Then, given a variable, we can compute the value of it by performing A'^ queries 
to Ox by simulating the process of the distributed algorithm. With this method, we achieve Oip. 
Though BasicLP is not a packing/covering LP, after applying several number of transformations, 
we get a packing LP that has essentially the same behavior under approximation. Technically, we 
need to show that BasicLP is robust in the sense that even if we violate each constraint by small 
amount, the optimal value does not significantly increase. We finally mention that, a predicate can 
return values in [—1, 1] in [25] while it can only return or 1 in this paper. This restriction comes 
from that we cannot transform BasicLP to a packing LP anymore if we allow negative values. 

Next, we exhibit a solution to the original instance by rounding the LP solution given by Oip. 
In [26], Raghavendra and Steurer considered a certain SDP relaxation, which we call BasicSDP, and 
showed an optimal rounding scheme for it. That is, it achieves an approximation ratio coincident 
with the integrality gap of BasicSDP. Our proof is based on their work. First, from an instance I 
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and its LP solution, we create another instance X' by merging variables of X that are close in the 
LP solution so that the number of variables in X' become constant. Though we cannot explicitly 
construct the whole X' since the number of constraints is not constant, we can enumerate variables 
in X' . Then, we perform brute force search on X' . Specifically, we estimate the value obtained by 
each assignment to variables in X' by accessing the oracle Oip. Let be the assignment for X' 
that takes the maximum among them. Note that /3'* can be unfolded to an assignment /3* for X. 
Then, with high probability, we have 5'a(c — e) — en < val(X, < opt(X). Since, from a variable 
V in X, we can get the corresponding variable in X' in constant time, we can compute /3* in constant 
time. The crucial fact used here is that the LP optimum does not change significantly after merging 
variables. 

Now, we describe a proof sketch of Theorem 11.21 Let X be a A-CSP instance such that lp(/) = 
cwx while opt(/) is arbitrarily close to S\{c)wx. Also, let (x*,^*) be the optimal LP solution to 
BasicLP for X. First, we create a distribution of instances X'°p* by blowing up variables of BasicLP. 
With high probability, an instance J generated by P^p* satisfies that opt(^) < opt(X) + e where e 
is an arbitrarily small constant. Next, using the LP solution (x* , fi*), we create another distribution 
of instances P'P, which has the property that for all generated by P^p, opt(j7) > lp(X). From 
Yao's minimax principle, by showing that any deterministic algorithm that distinguishes P°p* from 
P'P with high probability requires r2(-y/n) queries, we have the desired result. 

Organization: In Section [21 we give notations and basic technical tools used in this paper. In 
Section [31 we present an oracle access Oip to a (nearly) optimal solution to BasicLP. Section [4] is 
devoted to describe how to round the LP solution optimally and to prove Theorem II. 1[ We give 
proofs of Theorems 11.21 and 11.41 in Section [5] and Appendix [El respectively. 

2 Preliminaries 
2.1 Definitions 

For an integer k, [k] denotes the set {1, . . . , k}. The arity of a predicate P : [q]^ — )• {0, 1} is the 
number of inputs to P, i.e., k here. The degree of a variable is the number of constraints where the 
variable appears. For a constraint P in a CSP instance, V{P) denotes the set of variables in P. Let 
/? be a vector or a set indexed by elements of a set V. For a subset S* C y, we define (S^s = {l^v}v£S- 

Definition 2.1. A bounded- degree constraint satisfaction problem A is specified by A = {[q], s, t, w, P), 
where [q] is a finite domain, s is the maximum arity of predicates, t is the maximum degree of vari- 
ables, w is the maximum weight of predicates, and P = {P : [g]'^ — )• {0, 1} \ k < s} is a set of 
predicates. 

Definition 2.2. An instance X of a CSP A = {[q], s,t,w,¥) is given by X = {V,V,w), where 

• V = {vi, . . . , Vn} is a set of variables taking values over [q], 

• V is a set of constraints, consisting of predicates P G P applied to sequences S of variables V of 
size at most s. More precisely, when a predicate P is applied to a sequence 5* = {ii, . . . , ife} C 
[n]^ , P takes variables V\g = {vj^, . . . j-Uj^} as the input. 

• w is a set of weights {ifpjpgp assigned to each constraint P ^ V, where 1 < wp < w. 

The objective is to find an assignment to variables (3 G [q^ that maximizes the total weight of 
satisfied constraints, i.e., Y^p^y,wpP{fi). 
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Definition 2.3 (Bounded-degree Model). In the bounded- degree model, an algorithm is given a 
CSP A = {[q],s, t, w, F) and the set of variables V beforehand. A A-CSP instance I = {V, V, w) is 
represented by an oracle Ox such that Ox, on two numbers v £ V,i £ [t], returns P £ V where P is 
the i-th constraint where v appears. If no such constraint exists, it returns a special character _L. 
The query complexity of an algorithm is the number of accesses to Ox. 

In this paper, when there is no ambiguity, symbols q, s, t and w are used to denote the parameters 
of a considered CSP. Also, symbols n. Ox, wx are used to denote the number of variables, the oracle 
access, and the total weight of an input instance X, respectively. 

We consider an LP relaxation for a CSP A as follows, which we call BasicLP. 

max '^P E -P(/3)/^p,/3 

S.t. ^v,a = 1 "iv^V 

J2 tJ.p,3 = Xv,a yP£V,v£V{P),a£[q] 

/3e[g]^(-P),/3.=a 

> \/v £V,aG[q] 

^lp,p>Q yPeV,(3e[qf(P\ 

Here, = {xy^a}a£[q] (resp., Hp = {/^p,/3}^g[q]v{p)) can be seen as a distribution over assignments 
to a variable v G V (resp., a constraint P £ V), and we often identify them as distributions. 
For an LP solution (a;,/x), we define val(X, a;,/x) as the value of the objective function of BasicLP 
obtained by {x,fj,). We call an LP solution (ic,/x) e-infeasible if it satisfies constraints of the form 
Xv,a ^ and /^p^^ > and violates other constraints by at most e. We call a solution to an LP 
(a, 13) -approximate if the objective value obtained by the solution is an (a, /3)-approximation to the 
optimal value of the LP. 



2.2 Basic Tools 

As a simple application of Hoeffding's inequality, we obtain the following. 

Lemma 2.4. Suppose that we have an oracle access to a function / : [n] — )• [0, w]. That is, by 
specifying x G [n] as a query, we can see the value of f[x). Then, by querying O(^log^) times, 
with probability at least 1 — 6, we can compute a [1, en) -approximation to f{i). □ 

Let X be a A-CSP instance. Not surprisingly, we cannot compute the optimal solution [x* , /i*) 
of BasicLP for X in constant time. Even worse, it is also hard to obtain a feasible solution in constant 
time. Instead, we will compute a feasible (nearly) optimal solution (a;, /x) of an LP obtained by 
relaxing equality constraints. Though this is an infeasible solution in the original LP, The following 
lemma states that val(X, a;,/x) is close to lp(X). The proof, which needs Fourier analysis, is given 
in Appendix El 

Lemma 2.5 (Robustness of BasicLP). Let T be a A-CSP instance. Suppose that {x,fj.) is an 
e-infeasible LP solution for I of value cwx- Then, it holds that 

lp(X) > c-e-poly(gs). 



3 A (1 — e, en) -approximation algorithm for BasicLP 

In this section, we show the following theorem. 
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Theorem 3.1. In the bounded- degree model, given a A-CSP instance I, for any e > 0, we can con- 
struct an oracle 0\p that gives an access to an e-feasible (1 — e, en) -approximate solution to BasicLP 
fori. For each query, the number of queries performed to Ox is at most exp(exp(poly(gsttt;/e))). 

A packing LP is a problem of maximizing h'" z subject to z < c and z > 0, where A G M™^" is 
a non- negative matrix and b,c £ M" are non- negative vectors. There is a constant-round distributed 
algorithm to compute a nearly optimal solution to the packing LP (see Appendix [B] for a formal 
statement). When a variable x^^a or f^p^/j is specified as a query, we locally simulate the distributed 
algorithm and output the value for it. The only issue is that BasicLP is not a packing LP. In 
this section, we transform BasicLP to a packing LP, and we will show that we can restore a good 
approximation to BasicLP from an approximation to the resulting packing LP. First, we substitute 
Xv,a by 1 — x^^a and relax each equality constraint by e. Then, we obtain the following LP. 

max E P{l3)tJ-p,f} 

s.t. I x^,a-{q-l)\ <e yvGV 

\xv,a + E fip^p-l\<e yPeV,ve V{P),a G [q] 

Xv^a > \fv eV,ae[q] 

t^P,^>0 VPgP,/3g 

Lemma 3.2. Let I be a A-CSP instance and (x,^) be an e-infeasible solution to LP ([T]) of value 
cwx- Then, lp(X) > c — e • poly((7s) holds. 



Proof. Clearly, (1 — x,^i) is an 2e-infeasible solution to BasicLP of value cwx- From Lemma 12.51 
the lemma holds. □ 

Next, to make the directions of the inequalities the same, we introduce a complement variable 
for each variable, i.e., we define Xv,a = 1 — x^^a and flp^p = 1 — ^lp^p■ However, such equality 
constraints cannot be used in a packing LP. Thus, we relax those equality constraints again. That 
is, we introduce constraints of the form x^^a + Xv,a < 1 and /Xp^^ + Mp,/3 < 1- Instead, to discourage 
them to become much smaller than one, we add additional terms to the objective function. By 
letting C = (C, C, . . . , C) where C = g^(*)poly(tu;)/e2 is a large constant, we get the following LP. 

max Y wp E P{|3)^lpp + C'^{x + x) + C'^{^l + -p), 

P€V fSG[q]V(P) 

s.t. Y ^v,a <^J' — 1 + e \/v 

ae[q] 

Xv,a+ E /^P/3<l + e yP £V,v £ViP),a£[q] 

Xv,a+ E jLp,^ < q^^^''^^-^ + e yP£V,veV{P),a€[q] 

Xv,a + Xy^a < 1, Xy^a > 0, Xy,a > eV,a € [q] 

f^P,IS + Tip,p < 1, /^P,/3 > 0, 7Ip,/3 > VP G /3 G [q]^(P\ 

Fortunately, the optimal solutions to LP ([2]) and LP ([1]) are essentially the same. 

Lemma 3.3 (Theorem 7 of [8J, in a special form). Let X be a A-CSP instance and {x*,x*, ^* ,Jl*) 
be the optimal solution to LP ([2]) with value cwx + ON where N is the number of variables in 
LP ([2]). Then, x* -\-x* = 1 and /x* +71* = 1 hold. Also, (x*,fj,*) is the optimal solution to LP ^^ 
with value cwx- □ 
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Now, using the distributed algorithm given by |19j . we have the following lemma. The analysis 
of the query complexity is tedious and the proof is given in Appendix [Bl 

Lemma 3.4. In the bounded- degree model, given a A-CSP instance I, for any e > 0, we can con- 
struct an oracle that serves an access to {x,x, fj.,Ji), which is a feasible (1 — €,0)- approximate solu- 
tion to LP ([2]). For each query, the number of queries performed to Ox is at most exp(poly(gstii;/e)). 

Proof of Theorem \3.1\ Let {x,x, be a feasible (1 — e', 0)-approximate solution obtained by 

Lemma [3.41 where e' is a constant determined later. For notational simplicity, we write the objective 
function as w^/x + C'^{z + z) where z = (a;,/x). Let {z*,z*) be the optimal solution to LP ([2]). 
From Lemma 13.31 z* -\-z* = 1. Also, let N < qn + q^ ■ tn = (q + tq'^)n be the number of variables 
in LP ([2]). Then, we have 

w^fx + C^{z + z) > (1 - e'){w'^n* + C^{z* + z*)) = (1 - e'){w'^ ti* + CN). 

Thus, 

C^{z + z) > (1 - e')CN + (1 - e')w^Ai* - w'^ ^ > (1 - e')CN -wx>{l-e'- w/C)CN, 

w^fx > (1 - €')w^ti* + (1 - e'){CN - C^{z + z)) - e'C"^{z + z)>{l- €')w^ fj,* - e'CN. 

In the former inequality, we used the fact that wx < wN . In the latter inequality, we used the fact 
that CN >C^{z + z). 

Prom the former inequality, we have 

1^(1 -z-z) < {e' + w/C)N. 

Let S be the set of variables z, (= x^^a or fJ^p^f^) such that (1 — Zj — zj) > e" where e" is a 
constant determined later. From Markov's inequality, we have \S\ < (e' + w/C)/e" ■ N. Let 
Sx = S n {xv,a}v&v,aG[q] and 3^ = 3(1 {/2p^/3}pgp^^g[g]v(p). The variables in 3^ and 3^ are 
problematic since constraints in LP ([2|) involving them are far from being satisfied. Thus, in what 
follows, we modify these variables and obtain nearly feasible solution to LP 

First, we construct variables {Xy^a}y^v,a£[q] by setting x'^^^ = x^^a if none of {xv^a'}a'G[q] 
3x and ,j = 1/q if otherwise. Then, we construct variables {f^-'p /s} p^v{P) /9e[g]^(^) as follows. If 
none of {xv,a}v€V{P),a£[q] ^as modified in the previous step, we set /^pa = fJ-p^p- If otherwise, we 
set the values of {/^p/3}^g[g]v{p) in such a way that the distribution Hp becomes consistent with 
the product distribution determined by {x'^}y^p. Note that each modification to x in the previous 
step involves at most 2tq'^ modifications to fx. 

We calculate the decrease of the objective function. The decrease caused by the modification 
to x^^a is at most Ylp^v ""^-P ^ and the decrease caused by the modification to fip^^ is at most 
wp < w. Thus, the total decrease is at most tw\3\ + 2ti(;g*|5| < (e' + w/Q/e" ■ (1 + Iq^^wN . 

Note that for each unmodified variable z, (= Xy^a or fip^p), z[+'z[ > 1 — e" holds. Thus, 
{x' ,x' , n' ,fl') is an e"-infeasible solution with value at least 

w'^fx' > (1 - e')w'^n* - e'CN - (e' + w/C)/e" ■ (1 + 2q')twN 

> (1 - €')w^tx* - {e'C + {(' + w/C)/e" ■ (1 + 2q')tw){q + tq')n. 

Thus, {x', n') is an e"-infeasible (1 - e', (e'C + (e' + w/C)/e" ■ (1 + 2q^)tw){q + tg'')n)-approximate 
solution. By choosing e' = e^/(g*^'''*^poly(tu))) and e" = e, we have an e-infeasible (1 — e,en)- 
approximate solution. 

We need to look at q variables {xy^a}a£[q] to decide the value of x'^ ^^, and we need to look at 
at most qs variables {xv^a}vev(P),a&[q] to decide the value of /^p^. Thus, the number of queries 
performed to Ox is at most max((;, qs) exp(poly(gsit(;/e')) = exp(exp(poly(gsiw/e))). □ 
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4 Optimal Rounding of BasicLP 



In this section, using Oip, we give an algorithm described in Theorem ll.il Let I = {V,V,w) be a 
A-CSP instance. For a mapping cj) : V ^ V, we define a new A-CSP instance X/</> = {V ,V' ,w') 
on the variable set V by identifying variables of X that get mapped to the same variable in V' . For 
each constraint P £ V on the variable set {vi, . . . , v^} with weight wp, we have a constraint P' S V' 
on the variable set {(p{vi), . . . , '/'(vfc)} with weight wp. For x G [0, 1], we define = {k-\- l)e where 
A; is the positive integer such that ke < x < (A; + l)e. We define = when x = 0. In what follows, 
we assume that 1/e is an integer. If not, we slightly decrease e until 1/e become an integer. Let 
(a;,/x) be an LP solution for I. We identify variables v oi I that have the same values {x^ ^}a£[q]- 
Formally, we consider another A-CSP instance I/4>x where (px ■ V ^ {0, . . . , 1/e}'' is defined as 
0a; (^) = {^l 1, • • • , xl g). We have following two lemmas, the proofs of which are in Appendix O 

Lemma 4.1. Let I be a A-CSP instance and {x,fi) be an e-infeasible LP solution for I. Then, 
{x^, fi) is a {q + l)e-infeasible LP solution for X. 

Lemma 4.2. Let X be a A-CSP instance and {x,fj,) be an e-infeasible (1 — e, e)- approximate LP 
solution for X, where e > is a small constant. Then, the variable folding X/4>x satisfies that 

• lp(X/(/)a;) > lp(X) - e • poly(g'st'u;)n, 

• The variable set ofX/(j)x has a cardinality exp(poly(g/e)). 

Proof of Theorem \l.l\ Let e' be a constant determined later and {x, fi) be an e'-infeasible (1 — e', e')- 
approximate solution for X. Consider a folded instance X' = X/(f)x on the variable set V := 
{0, . . . , 1/e}''. Since there are at most exp(poly(g/e')) variables in V', there are at most N := 
exp(exp(poly(g/e'))) assignments to V'. For each assignment /3' G [q]^ , we estimate the value 
val(X',/3') as follows. First, we note that (3' can be unfolded to an assignment (3 £ [q]^ to X with 
the same value. Then, for each variable v £ V, we associate a value = J2p^v '^(f^)/\^\- 
clear that < fv ^ tw and Ylv^v ~ ^^K-^^t^) — val(X',/3'). Also, we can calculate the value 
fv by querying Oip at most qst times. Thus, using the algorithm given in Lemma 12.41 we get a 
(1, en/2)-approximation to val(X',/3') with probability at least 1 — 1/3A^ by querying Oip at most 
poly {q stw / e)0 {log N) times. 

By the union bound, with probability at least 2/3, we obtain a (1, en/2)-approximation to 
val(X',/3') for every assignment /?'. Let G [q]^ be the assignment that takes the maximum 
value among those assignments. Then, val(X',;8'*) is a (1, en/2)-approximation to opt(X'). The 
number of queries performed to Oip is at most poly((?stw;/e)0(A^ log A^). 

Let (3* G [q]^ be the unfolded assignment of /?'*. We can safely assume that wx' = wx > en. If 
not, val(X, /?*) is indeed a (1, en)-approximation to opt(X). When wx' = wx > en, it holds that 



We are done by setting e' = e'^ /polj{qstw). The number of queries performed to Ox is at most 
poly(gstti;/e)0(A^log A^) • exp(exp(poly(gstti;/e'))) = exp(exp(poly(gsttt;/e))). Once we have fixed 
/3*, given a variable v £ V, we can compute /3* by accessing Oip q times. The query complexity is 



val(X,/3*) > opt(X')-y 



> SA{lpiX'))wx - y 




(using Wx > en) 



(using Lemma l4.2p 



at most exp(exp(poly(gsiu;/e))). 



□ 
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P={u,v) P^*(P) vXAP) 

Figure 1: Construction of P^^(P) and Here, the alphabet size q = 2, and we choose 

N = 5 and T = 1. Also, Atpoo = 0-^' /^poi = /^Pio = 0-^' ^^"^ /^Pii = 0. It follows that 
^1,0 = 0-6, ^E*^^ = 0.4, a;* = 0.8, and x^ i = 0.2. White (resp., black) variables in V^^j,{P) 
indicate that they are assigned to (resp., 1). 

5 Lower Bounds 

In this section, we prove Theorem 11.21 As we described in the introduction, we utilize Yao's minimax 
principle. That is, we construct two distributions of instances such that they have much different 
optimal values and also it is hard to distinguish them in constant time. We fix a A-CSP instance 
X = {V,V,w) with the optimal LP solution (x*,^*) throughout this section. To convert the LP 
integrality gap opt(X)/lp(X) of X to hardness results, we construct two distributions 'D'^!^ and 'D^^j, 
using X and {x*,fj.*). Here, and T will determine the number of variables and the maximum 
degree of instances generated by 'D'^!^ and T^^^j,, respectively. We show that, by taking T as a large 
constant (independent of A^), almost all instances J in "D^^ satisfy that opt(J') < opt(X) + e. 
Also, we show that all instances in 'D^^j, satisfy that opt{J') > lp(X). Finally, we define X'tvt 

as the distribution that chooses X'^-p or X'J^j^ randomly and outputs an instance generated by 
the chosen distribution. Then, given an oracle access Oj to an instance J generated by X)^j,, a 

deterministic algorithm is supposed to guess the original distribution {V^^ or X'j^j^) of with 

probability at least 2/3. By showing that such an algorithm requires ^}{^/N) queries, we conclude 
that any randomized algorithm that, given an instance distinguishes the case opt{J') > lp(X) 
from the case opt{J') < opt(X) + e requires f](\/iV) queries. By choosing as X an instance with 
the worst integrality gap, we have the desired result. 

Construction of T)^^: Before stating the construction of X'^P^, we introduce a distribution 

V'^^P) for a constraint P £ V applied to a sequence of variables {vi, . . . (see Fig. [1|). An 

instance J'p of V'^^P) is generated as follows. The variable set of J'p is {vi, . . . ,Vk} x [A^]. We 
naturally regard that the set of variables Vi = {(vj, j) | j G [A^]} corresponds to a variable Vi. Next, 
we create TA^ constraints among those variables. To this end, after splitting each variable of J7p 
into T copies, we take random perfect /c-partite matching in such a way that each matching takes 
one variable from each V^. For each such matching {ui, . . . , Uk} where Ui is of the form {vi,ji), we 
create a constraint P{ui, . . . ,Uk) of weight wp. Finally, we merge the split variables. 

We define the distribution using P^^^(P). An instance J' of is generated as follows. 

For each P € "P, we create an instance J7p according to the distribution X'^^(P). Then, J is 
a union of {Jp}pev obtained by merging variable sets as follows. Let Pi,. . . ,Pe G V be the set 
of constraints containing a variable v £ V. We let Vi{i G [i]) denote the set of variables in ^Tp. 
corresponding to v. Then, we take random perfect ^-partite matching among Vi, . . . , V^, and we 
merge variables in each matching. We repeat the same process for every v £ V. We note that the 
variable set of JT" is F x [A^], and the number of constraints in J' is ["PlTA^. Now, we decide the 
indices of constraints, which are used as arguments of the oracle access Oj. We use the following 
rule. Suppose that P is the i-th constraint where v £ P appears (in the sense of X), then for 
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a variable {v,j){j G [N]), we randomly assign T indices {(T — l)i + l,...,Ti} to designate T 
constraints made by J^p. Finally, labels of vertices are randomly permuted. 

Construction of I^j^T' Before stating the construction of ^)vT' s-g^-iii we introduce another 
distribution I?jyy(P) for a constraint P G "P applied to a sequence of variables {vi, . . . ,Vk}- (see 
Fig. [1]). An instance is generated as follows. The variable set of J'p is {vi, . . . ,Vk} x [N]. We 
naturally regard that the set of variables Vi. = {{vi,j) \ j S [A^]} corresponds to a variable Vi. For 
each /3 G [9]^*-^^ we take a /^p ^-fraction of variables from each Vi, and let T/j^^ C Vi denote the set 
of such variables. Variables in Vi^js are said to be assigned to /3y. G [q]- A subtlety here is that fJ-p^pN 
may not be an integer. We ignore this issue for simplicity since we can make the error arbitrarily 
small by choosing A^ large enough. Next, we create fipj^TN constraints among Vi^f^, . . . , V^^^. To 
this end, after splitting each variable into T copies, we take random perfect /c-partite matching in 
such a way that each matching takes one variable from each Vi^jj. For each matching {ui, . . . ,Uk} 
where Ui is of the form {vi,ji), we create a constraint P{ui, . . . , u^) of weight wp. Finally, we merge 
the split variables again. We note that, for any variable Vi £ V{P), an x*. ^-fraction of variables of 
Vi is assigned to a. 

We define the distribution "D^^j, using ^^^^^{P). An instance of I'Jyy is generated as follows. 

For each P G we create an instance J^p according to the distribution V^^rpi^P). Then, ^ is a 
union of {Jp}pev obtained by merging variable sets as follows. Let Pi, . . . , G 7^ be the set of 
constraints containing a variable v ^V. We let Vi^a{i G [^],a G [q]) denote the set of variables in 
Jp^ that correspond to v and are assigned to a. Note that the sizes of Vi^a{i S are the same 
from the construction of V^^ j.{Pi). We take random perfect f-partite matching among Vi^a, ■ ■ ■ , V?,a 
and we merge vertices in each matching. We repeat the same process for every v £V and a G [q]. 
Note that the variable set of ^7 is ^ x [A^] and the number of constraints in J is I^ITA^. To decide 
the indices of constraints and labels of vertices, we use the same rule as ^jvr- 
We have the following three lemmas, the proofs of which are in Appendix [Pl 

Lemma 5.1. For every e > 0, there is aT > satisfying the following. Let J' be a A-CSP instance 
generated by 'D'^^. With probability 1 — o(l), opt(J') < opt(X) + e. 

Lemma 5.2. Let J be a A-CSP instance generated by D^^j,. Then, opt(^7) > lp(X) holds. 

Lemma 5.3. In the bounded- degree model, any deterministic algorithm that, given an oracle access 
to Oj generated by D'^rp, correctly guesses the original distribution of J' with probability at least 

3/5 requires at least r2(\/iV) queries. 

Proof of Theorem \l.<A Let us fix c G [0, 1] and s = S\{c). Then, there exists a A-CSP instance I 
such that lp(X) = c and opt(X) is arbitrarily close to s. Suppose that there exists a deterministic 
algorithm A with query complexity o{^/n) that, given an instance ^7 of n variables, with probability 
at least 2/3, distinguishes the case opt(J') > cwj from the case opt( J") < (s + e)w j — en. Let T 
be a constant given by Lemma l5. II by replacing e with e/2. 

Suppose that J is generated by 1)^^. Then, from Lemma [5. 11 with probability at least 1 — o(l), 
it holds that opt(j7) < (s + e/2)wj = (s + e)wj — ewj/2 < (s + e)wj — en. In the last inequality, 
we use the fact that wj/2 > wj/T > n when e is small. Suppose that J is generated by ^?)vr- 
Then, from Lemma 15.21 it holds that opt(^7) > cwj. 

Thus, in total, the algorithm outputs the correct answer with probability at least 1/2 • (1 — 
0(1)) • 2/3 + 1/2 • 2/3 = 2/3 - o(l). This contradicts LemmaEH □ 
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Appendix 



A Robustness of BasicLP 

In this section, we give a proof of Lemma 12.51 Our strategy is transforming (x,^) to a feasible 
solution without decreasing the LP value much. In the first step, we construct x' from x that 
satisfies X^ag^g] x'^^^ = 1 for every v £ V. 

Lemma A.l. Let {x,fj,) be an e-infeasible LP solution for a A-CSP instance X where e > is a 
small constant. Then, x can be transformed to x' so that 

Y,<a = 1 yv^V, (3) 

\x'^,a - Xv,a\ < 2e yv£V,a£ [q]. (4) 

In particular, {x',fj,) is a ^e-infeasible LP solution that satisfies Yl,ae[q\ ^v,a — ^ f^'^ every v . 

Proof. We define x'^ ^ = x^^a/ Ylae[q\ ^v,a- The condition ([3]) clearly holds. From the e-infeasibility 
of X, I J2ae[q] ^""'i^ — 1| < e holds. It follows that \x'^^^ — x^^al < — e) < 2e when e is small. □ 

In the second step, we construct fi' that satisfies X^^g[g]v(p) ^ /Xp = x'^ ^ for all P G V,v £ 
V{P). 

Lemma A. 2. Let {x, /i) be an e-infeasible solution for a A-CSP instance I satisfying X^^^j^j Xy^a = 
1 for every v €z V. Then, fj. can be transformed to /x' so that 

Pr = a] = {l-6)x,,a + - ^P £ V,v £ V{P),a £ [q], 
Il/^P-Atp||i < 25 \/P£V. 

where 6 = kq^e. 

Proof. Let us fix a predicate P £ V and S = V{P). We may assume S = {1, . . . ,k} where k < s. 
We can think of function / : [g]'^ — )• M such that is the probability of the assignment 

j3 under the distribution /Xp. 

Let xi, • • • , Xq be an orthonormal basis of the vector space {/ : [q\ — )■ M} such that xi = 1- Here, 
orthonormal means that Ea^[q]{xi{a)Xi{0')\ = ^ij for all hJ ^ W\ where 5 is Kronecker's delta. By 
tensoring this basis, we obtain the orthonormal basis {Xp] pe[q]'' '^^ vector space {/ : [q]^ — )• M}. 
That is, for p £ [qf,[5 £ [qf , we have Xp(/3) = XpAl^i) ' ' ' Xp^iA)- For a function / : [q]'' R, 
we define f{a) = E/3G[g]'= /(^)x<t(/3)- Note that /(/3) = E^^[,j]k[f{a)xa{/3)]. Therefore, if we let / 
again be the function corresponding to /ip, we have 

Pr [(3i = a]= E 

Here, fi{s) = f{cr) where ai = s and ar = ^ for all r £ [k]\ {i}. In the second inequality, we used 
that for every a with cr^ 7^ 1 for some r £ [k] \ {i}, the sum over the values of Xa vanishes. 

We let gi : [g] — )• M be the function gi{a) = Xi^a- We define a function /' : [q]^ — )• M as follows. 





= E 


lM)Xa{a) 




<TG[g] 





gi{s) if (Tj = s and ar = 1 for all r £ [k]\ {i}, 
f{cr) otherwise. 
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This is well-defined since for any i G [k], it holds that gi{l) = X]ae[g] ^^i(^) ~ Z]ae[g] ^i." ~ 
Therefore, the function /' satisfies Ylf5e[q]'' f'((^) ~ f(^) ~ Then, we can define a distribution 
n'p corresponding to /', and we have 



Pr [ft 



E 

ae[q] 



X 



Thus, it looks that the /Xp is the desired distribution. However, in general, the function /' might 
take negative values. We will show that these values cannot be too negative and that the function 
can be made to a proper distribution by smoothing. 

Let K be an upper bound on the values of the functions xi-, ■ ■ ■ iXq- From the orthonormality 
of the functions, it follows that K < y/q. Let fi{a) = Pr/3^^^[/3i = a]. Since the LP solution (x, fi) 
is e-infeasible, we have 



- Ms) 



^ 9i{a)Xs{a) - ^ fi{a)Xs{c 
ae[g] 



ae[q_ 



< Kqe. 



Therefore, \f'{a) - f{a)\ < Kqe for all a £ [qf. Recah that \ f'{a) - f{a)\ = for cj G [qf if there 
are i ^ j such that fij 7^ 1, o",- 7^ 1. Thus, 



im-/(/3)i 



E 

ae[q]'<: 



na)xa{l3) - f{a)xA/3) 



< S/q\ 



(5) 

is the uniform 



where 5 = K'^kq^e. Hence, if we let h = {1 — S)f' + 5U, where U : [g]^ — 
distribution U = l/q^, then 

h(x) = (1 - 5)f'{x) + 6/q^ > (1 - 5)f{x) > 0. 

It follows that h corresponds to another distribution /ip over assignments [q]^. Furthermore, it 
holds 

Pr [ft = a] = {l-6)xi^a + -- 
p^n'p q 

Finally, let us estimate the statistical distance between the distributions /ip and /Xp. 

11/ - /illi = 11(1 - 5){f - f) + 5if - U)\\i <\\f-f'\\i+6< 25. 

The first inequality is from the triangle inequality and the second inequality is from ([5]). □ 

Proof of Lemma \2.5[ Let us consider an e-infeasible LP solution {x,fj,) for a A-CSP instance I of 
value cwx- First, we construct vector x' as in Lemma lA.li These variables together with the 
original local distributions /x form an 3e-infeasible LP solution for I. Next, we construct local 
distributions /x' as in Lemma lA.21 Define new variables 

x'l^ = {l-5)x[^^ + 5/q. 
It follows that {x" , fj.') is a feasible LP solution for X. The LP value of this solution is 
^wp EjP(/3)] = cwx-Y,^P Yl ^(^) 

> cwx - wp\\fj,p - /xplli 

P€V 

> cwx — e ■ poly{kq)wx- 

We used |-P(x)| < 1 for the first inequality, and the second inequality follows from Lemma I A. 21 □ 



14 



B Proof of Lemma 13.4 



In this section, we give a proof of Lemma [3.4[ We consider a more restricted form of a packing LP: 

max I'^z 

s.t. A^z < c (6) 
z>0, 

where A E R™^"" is a non- negative matrix such that aji = or aji > 1 for any j £ [m\,i G [n], and 
c G is a non-negative vector. 
Define 

m n 

Cmax = maxci, Tp = max > aji, L^ = max > a,,. 

* i=l i=\ 

Then, there is a distributed algorithm that solves this packing LP. 

Lemma B.l ([E]). For sufficiently small e > 0, there exists a deterministic distributed algorithm 
that computes a feasible {1 — e, 0) -approximate solution to LP ([6]) in 0(log Fp log F^/e^) rounds. □ 

In order to apply Lemma iB.ll to LP we transform it to the form LP Note that, in the 
objective function, the coefficient of /ip_^ is wpP{f3) + C and the coefficients of Xy^a-,Xv,a,V-p,i3 
are C. Thus, by replacing /^p^^ with fip^^/{wpP{l3) + C) and replacing Xy^a,Xv,a,Jtp^i3 with 
Xv,a/C,Xy^a/C,Jip^^/C, respectively, we obtain the following LP. 



max 1 {x + X + fj. + fi) 



s.t. E^<9-l + e yveV 



a&[q] _ 



a£[q] 

^+ E <l + g '^PGV,vGV{P),ae[q] 

/3e[<?]^"(P),/3„=a 

^ + ^<_1, X.,,a>0, X.,^a>0 yveV 

_^J^ + ^<1, ;,^^^>o, jZp^^>0 VPGP,/3eM^(^). 

We multiply each constraint in order to make every coefficient in the LHS at least 1. Then, we 
have the following LP. 

max l'^{x + x + 

s.t. E Xv,a <C{q-l + e) yveV 
E ^v,a < C(l + e) yvev 

aG[<j] 

'^^v,a+ E ^,ViB)+i t^P,P<i'^ + ^)i^ + C) yPGV,vGViP),ae[q] (7) 

/3e[g]^(^')A=a 

Xv,a + E 7^P /3 < + e) VP G G y (P) , a G [^7] 

3^1), a + X,i]^a ^ C*) 3;^^Q, > 0, X^^a ^ Vf G Y 

^,ptg+i /^P,/3 + ^Mp,/3 <w + C, Mp,^>0, 7Ip,^>0 VPGP,/3G[g]^(^). 
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Proof of Lemma \3.4\ Note that LP ([7]) is of the form LP ([6]). After a calculation, we have 

c^^. = 0{C{w + q')), Tp = 0{{s + t){w + C)-C{w + q')), = 0{{w + C)q'). 

We define the degree of a variable in an LP as the number of inequalities where the variable appears. 
Let Ap and be the maximum degree of primal variables and dual variables, respectively. Here, 
we treat LP ([7]) as a dual formulation. We have 

Ap = 0{q'), Ad = 0{s + t). 

Applying the algorithm given in Lemma iB.ll to LP ([7]), we obtain a distributed algorithm that 
calculates (1 — e, 0)-approximate solution. The number of rounds is 0(log Fp log Frf/e^). Note that, 
given a variable, we can simulate the computation of the distributed algorithm involved by the 
variable with (ApA^)'' queries, where r is the number of rounds. Thus, the query complexity 
becomes 

(ApArf)^('°s'^f = exp(poly(gstw;/e)). 

□ 

C Proofs from Section |4] 
C.l Proof of Lemma 14.11 

Proof. Since we move each x^^a by at most e, each constraint X]ag[g] ^v,a — 1 can be at most {q+l)e- 
infeasible. Also, each constraint J2i3e[q]^'-^'> i3^=at^P,i3 ~ ^v,a can be at most 2e-infeasible. □ 

C . 2 Proof of Lemma 1472] 

Proof. Since the size of the range of is (l/e)'^^''^, the second claim is obvious. 

Suppose that (x, /x) has an LP value cwx- From the fact that {x, fi) is a (1 — e, e)-approximate 
solution, we have cwj > (1 — e)lp(X) — en. Also, by Lemma |4.H {x^,^) is a (g + l)e-infeasible LP 
solution. Since only /i affects the value of the objective function, the LP value of (x'',fi) equals 
cwx- A key observation is that (a;^,/x) is also an LP solution for the folded instance I/4>x. Thus, 
we see that I/(px has a (g + l)e-infeasible solution of value at least cwx- From Lemma [2.51 we have 

lp(X/0a;) > {c - {q + l)e ■ poly{qs))wx 

> (1 — e)lp(X) — en — e ■ poly{qs)wx 

> lp(X) — e • poly{qstw)n. 

In the last inequality, we use the fact that lp(X) < wx < twn. □ 

D Proofs from Section [5] 

D. l Proof of Lemma 15.11 

Let be an instance generated by Let Pj(l < i < 2) be a constraint on a variable 

sequence {u\, . . . , u^} in J'p. Note that the arities of Pi are the same since they both are copies 
of P. For each j G [k], we choose fj G {u^uj} arbitrarily and Vj be the remaining one, i.e., 
{UjjUj} \ {Vj}. Then, we define a constraint Qi{l < i < 2) on the variable sequence {v\, . . . ,vl}. 
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We create another instance J'p from J'p by replacing {Pi,P2} by {Qi,Q2}- We call this method 
switching. The following concentration bound is obtained by a simple application of Theorem 2.19 
in t28j. 

Lemma D.l. If ^ is a random variable defined on P^^(P) such that |X(Jp) — X(i7p)| < c holds 
where Jp and J'p are instances of'D^^{P) that only differ by a switching, then 

Pr [\X{Jp) - E[MJp)]\ >t]< 2exp (-7f^) 

for all t>0. □ 

Proof of Lemma \5.1[ Let a G [(/J^^l^] be an assignment to J. For v &V and a G [g], we define 
Xv,a = #{i e [N] I a(„^i) = a}/Af. Also, for P G P and /3 G we define /Xp_^ = n,;GP ^^'./S^. • 

Note that Xy (resp., ^p) gives a probability distribution over assignments to the variable v (resp., 
the variable set V{P)). 

Let J'p be the sub-instance of J generated by 'D^^{P) for P G "P. The expectation (over 
P^P^(P)) of the value gained by a constraint P in Jp is Eppr^f^p [P(/3p)]. Thus, it holds that 

E [va\{J,a)] = y] E [val( Jp, a|y(p))] = TiV V i(;p E [P(/3p)] 

= TiV J^^p^E [P(/3|v(P))] = riV/ [Y^wpP{(3\y^p))]=TN E [val(X,/3)] 
Thus, it follows that 



Pep /^-/^ /^-/^ pgp /^-/^ 



E [val(J,a)] <TiVopt(X). (8) 

Note that, for instances Jp and J'p generated by P^*'^(P) such that they differ by a switching, 
val(j7p, a|v(p)) and val(i7p, a|v(p)) can differ by at most 2wp < 2w. Then, from Lemma lD.lt 

Pr [|val(Jp,a|y(p)) -E[val(Jp,a|v(P))]| > t] < 2exp (- ^^^^2 ) ■ 

Then, 

Pr[|val(J,a) -E[val(J,a)]| > tiT'l] < Pr [3P G 7^, |val(Jp, a|v(P)) - E[val(Jp, a|y(p))]| > t] 



< 2|P|exp 



The last inequality is from the union bound. 

We choose t = eTN so that t\V\ = e\'P\TN < ew j. We have 

1 / e^TN\ 

val{J,a) - E[val(J,a)]| > ewj < 2\V\ exp — . (9) 



We combine ([8]) and ([9]) with the union bound over all gl^l^ assignments. It holds that 
Pr 3a, val{J, a) > TNopt{I) + ew j = Pr 3a, ^{J, a) > o^{l) + e 

< 2|P|expf-4^1gl^l^. 



4w'^ J 

by choosing T = ©(if^ log g/e^), we have the desired result. Note that \V\ and \V\ can be seen as 
constants when is sufficiently large. □ 
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D.2 Proof of Lemma 15.21 

Proof. Let a G be the natural assignment to variables in J^. That is, a{v,i) = a when the 

variable {v,i) is assigned to the value a in the construction of ^jvy Then, 

opt( J) > val( J, a)=Y^ val{Jp, a|y(p)) = TiV ^ i(;p E ^ [P(/3p)] = TNlp{I). 

Pe-p Per /^p~i^p 

□ 

D.3 Proof of Lemma 15.31 

For notational simplicity, we omit subscripts N and T in this section. We define some notions. At 
each step of an algorithm, a variable v is called seen if v is appeared in queries to the oracle or 
answers by the oracle so far. Also, an index z of a variable v is called seen if the i-th constraint of 
V is already returned by the oracle. 

Here, we only show a lower bound for a (randomized) algorithm whose behavior is slightly 
restricted. That is, when an algorithms asks for a constraint incident to an unseen variable, 
we assume that the algorithm chooses the variable uniformly at random from the set of unseen 
variables. We can get rid of this assumption using the technique presented in Section 4 of [12J. 
Details are deferred to the full version of the paper. In what follows, we regard that the oracle 
accepts two types of queries. The first one is same as the original, i.e., when we specify a variable v 
and an index i, the oracle returns the i-th constraint of v. The second one simply returns a random 
variable from the set of unseen variables without receiving any argument. When an algorithm asks 
for a constraint incident to an unseen variable, it uses the second type of queries to get a variable 
first, and then it uses the first type of queries to get a constraint incident to the variable. 

Now, we prove Lemma 15.31 Recall that, from Yao's minimax principle, it suffices to consider 
deterministic algorithms. We basically follow the approach presented in Section 7 of |11J. Let 
^ be a deterministic algorithm. We introduce a randomized process V°P^ (resp., -P^P), which 
interacts with A so that 'P°p* (resp., "P'p) answers queries of A to the oracle while constructing 
a random instance from D°p* (resp., D'p). The final distribution of instances generated by "P^p* 
(resp., P^P) coincides with I?°p* (resp., 2?^p) no matter how A makes queries. The interaction 
between A and P°P* (resp.,P^P) precisely simulates the interaction between A and Oj where 
is an instance generated by the distribution D°p* (resp., P^p). The process V*, which corresponds 
to the distribution V*, is simply a process that chooses P°p* or "P^p randomly and behave as the 
chosen process. 

A transcript is the part of an instance that A has seen through the interaction with a randomized 
process. Note that, the transcript contains the information about labels of vertices and indices of 
constraints. Let /Cr^* (resp., /Ci?) be the distribution of transcripts after r-step interaction between 
A and P°P* (resp., P'P) (Here, /C stands for knowledge). The statistical distance between /Cr^* 
and is defined as follows. 

Pr [K' = K]- Pr [K' = K] 

From the argument given in Section 7 of [llj, by showing that dTyiJCr^^ ,}C^) = o(l) when r = 
o{^/N), we have the desired result. 

We can safely assume that A never asks for the same constraint twice or more. Also, we assume 
that, if V* returns a constraint containing a variable in the transcript, A can correctly guess the 



dTv(/cr,^i'') = ;^E 

^ K 
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process ('P^'p* or "P'p) with which A is interacting. In other words, we are assuming that, when 
■popt (^]~ggp_^ "P'P) returns a constraint containing a variable in the transcript, it also returns a 
certificate stating that the current process is 

-popt (resp., T^'P). This only improves the ability of 

A and makes the lower bound smaller. 

Now, we define the randomized process J'°p*. We omit the definition of "P^p as it is very similar 
to the construction of 'P°P*. The process ^"p* has two stages. The first stage proceeds as long as 
A perform queries. In this stage, P°P* chooses an answer for each query. In the second stage, the 
process completes the transcript into an instance J. 

We identify [n] (resp., [nA^]) with the set of variables of I (resp., an instance generated by 
P°P*). Recall that, in an instance generated by I?°p*, the variable set [nN] can be separated into 
n sets, each of which corresponds to a variable i £ [n]. The process 7^°p* incrementally constructs 
this correspondence. A (partial) correspondences is represented by a map p: [nN] — t- [n] U {-L}. 
For a variable i G [n], let Vi = {v £ [nN] | p(v) = i} and Ni = [Vi[. Also, for each vertex v G [nN] 
and an index i S [d], let Di{v) = {j G {(T — l)i + 1, . . . ,Ti} | j-th constraint of v is seen} and 
di{v) = [D^{v)[. 

In the first stage, given a query by an algorithm A, chooses an answer for it as follows. 

• When the query asks for a random unseen variable: we choose a random unseen variable 
V G [nN], and set p{v) = i with probability ^ '^^(^^'Ln .) ■ Then, we return v to A. 

• When the query asks for the p-th constraint of v: Note that p{v) 7^ _L from the assumption 
that, when A asks for a constraint incident to an unseen variable, it asks for a random unseen 
variable beforehand. Let q be such that (T — l)q + 1 < p < Tq, and P be the q-th constraint 
of p{v) in I, which is applied to a sequence of variables {ii, . . . , i^} in X for which i£ = p{v) 
for some i € [k]. Also, let qj be such that P is the qj-th constraint of the variable ij in I. 
Note that qi = q- 

Then, we choose a set of variables as follows. For each variable u with p[u) = ij, 

we choose u as with probability , — im — at—vt- If otherwise, we choose a 

random unused variable u as vj ans set p{u) = ij. 

Let P' be a constraint applied to a sequence {vi, . . . , Vk} of weight wp. Finally, we determine 
indices for each variable Vj{j 7^ i). We choose a random index pj from unused indices in 
{(T — l)qj + 1, . . . , Tqj + 1}, and set P' be as the pj-th. constraint of Vj. Then, we return P' 
as the answer for the query. 

In the second stage of P°p*, the process uniformly selects an instance among all those who are 
consistent with the final transcript. 

Lemma D.2. For every algorithm A, the randomized process "P^P* (resp., V^^) when interacting 
with A, uniformly generates an instance J in D°p* (resp., P'PJ. □ 

Proof. The lemma easily follows by induction on the query complexity of A. The base case is clear 
since if no query is made, then the distribution on instances generated by 7^°p* (or, P'p) is clearly 
uniform. The induction step follows directly from the definition of the process. In particular, the 
distribution on instances resulting from the process switching to the second stage after it answers 
the query is exactly the same as the distribution resulting from the process performing the second 
stage without answering the query. □ 

Proof of Lemma 15.31 Let .4 be a deterministic algorithm. It is convenient to think that labels of 
variables are determined on the fly. That is, V* decides labels of variables from [nN] at the time 
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when the variable appears for the first time in the interaction between an algorithm and V. The 
distribution never change by this modification. Also, we can think that the sequence of labels 
is determined beforehand, and for each time when a new variable appears, a new label for the 
variable is taken from the front of the sequence. Let be the process obtained from by fixing 
the sequence to L It is clear that V* coincides with the process that takes I uniformly at random 
and acts as V^. Let "P^^* (resp., V^^) be the process obtained from P°p* (resp., "P'p) by fixing 
the sequence to I. Then, it suffices to bound the statistical distance between the distribution of 
transcripts when A interacts with V"^^ and the one when A interacts with V^^ for any sequence I. 

A deterministic algorithm A with query complexity r can be expressed as a decision tree of 
depth at most r. Here, each node in the decision tree corresponds to a query to the oracle, and 
each branch from the node corresponds to the answer by the oracle. Recall that, from the rule 
of indices, if we fix an index, the process always returns the same predicate (though the set of 
vertices to which the predicate is applied should differ). Also, since we have fixed the sequence 
of labels at each node in the decision tree, there is just one branch corresponding to the case 
that A finds a constraint such that any variable in the constraint (except the queried variable) 
is not in the transcript. Ignoring branches for which A outputs an answer, the decision tree has 
the property that the number of children of each node is at most one. Thus, A is essentially a 
non-adaptive algorithm. Without loss of generality, we assume that A outputs that the current 
instance is generated by I?°p* after r steps. 

Suppose that the current process is V°^^ and A is asking for a constraint incident to some 
variable in the i-th query. Note that A has seen at most is variables. Then, from the construction 
of "P^P* , the probability that V°^^ returns a variable in the transcript is at most (jy!!^^-^ • s = j^'Lis ■ 

Using the same argument, we can show that, in the i-th query, the probability that V^^ returns a 
variable in the transcript is at most where // is the minimum of {fJ'P^i3}p^'p /3g[q]V(F) except 

0. 

Thus, from the union bound, after r steps, the probability that returns a variable in the 
transcript is at most 



E 



< 



^ IjlN — is jjN — Ts 

Then, the probability that A outputs the correct answer is at most ^jy^^^ + I- make this 
probability at least 3/5, we have to choose r = Note that /i is a positive constant 

independent of A^. □ 



E Proof of Theorem 11.41 

Proof. We show the first part of the theorem. Let A be a CSP such that 5'a(1) = 1 — 7 for some 
7 > 0. Suppose that there exists a testing algorithm for the CSP A with o{\/n) queries. Note that a 
^-far instance I satisfies that opt(X) < wj — < (1 — ■^)'Wx- Thus, using the testing algorithm, 
given an instance I, with probability at least 2/3, we can distinguish the case opt(X) = wx from 
the case opt(X) < (1 — ^)wx- However, instantiating Theorem 11.21 with e = 7/3, the theorem 
asserts that any algorithm that, given an instance X, with probability at least 2/3, distinguishes 
the case opt(X) = wx from the case opt(X) < {S\{1) + ■^)wx = (1 — ^)'Wx requires fl{^yn) queries. 
This is a contradiction. 

We show the second part of the theorem. Let A be a CSP such that S'a(I) = 1. Since Sa{c) 
is continuous at c = 1, for any e > 0, there exists 5 such that S\{1 — 6) > 1 — e/2. Consider the 
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algorithm obtained by instantiating Theorem 11.11 replacing e with min(e/2,(5). Suppose that X is 
a satisfiable instance. Then, we obtain a value x > S\{1 — 5)wx — en/2 > (1 — e/2)wx — en/2. 
Suppose that T is an instance e-far from satisfiability. Then, we obtain a value x < opt(X) < 
wx — etwn < (1 — e/2)wx — etwn/2. Thus, we can test the satisfiability of the CSP A in constant 
time. □ 



Acknowledgements 

The author is grateful to Hiro Ito and Suguru Tamaki for valuable comments on an earlier draft of 
this paper. 



21 



